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LINEAR STATISTICS OF THE CIRCULAR ^S-ENSEMBLE, 
STEIN’S METHOD, AND CIRCULAR DYSON BROWNIAN 

MOTION 

CHRISTIAN WEBB 


Abstract. We study the linear statistics of the circular /3-ensemble 
with a Stein’s method argument, where the exchangeable pair is gener¬ 
ated through circular Dyson Brownian motion. This generalizes previ¬ 
ous results obtained in such a way for the CUE and provides a novel 
approach for studying linear statistics of /3-ensembles. This approach 
allows studying simultaneously a collection of linear statistics whose 
number grows with the dimension of the ensemble. Also this approach 
requires estimating only low order moments of the linear statistics. 


1. Introduction 

The goal of this note is to study linear statistics of the circular /3-ensemble 
(which we will usually denote by C/3E or C/3E(n) if we wish to stress 
the dimension). More precisely, if (e“i, ...,6“") is a realization of the n- 
dimensional C/3E, we shall study the Wasserstein-1 distance of the law of 


( 1 ) 


to the law of 


Td = 



d 


k=l 


( 2 ) 



where Zj are i.i.d. standard complex Gaussians. 


Our main result will be that if d grows slowly enough with n, the distance 
goes to zero as n —>■ oo. Our approach will be to apply Stein’s method 
for which we shall generate an exchangeable pair through circular Dyson 
Brownian motion. The estimates one will then need to apply Stein’s method 
involve some low order moments of Td for which we can make use of results 
of [15]. 


The motivation for this approach comes from [11, 9], where a similar ap¬ 
proach is used for /3 = 2 (as well as the circular real ensemble and circular 
quaternion ensemble, i.e. the Haar measure on the orthogonal and symplec- 
tic groups), though the relevant dynamics is interpreted through the heat 
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kernel on the unitary group which does not generalize so obviously to other 
values of /3. 

While the fact that finite collections of such linear statistics converge 
jointly in law to independent Gaussians with suitable variances, is certainly 
known (e.g. the approach of [16] should be easily adapted to the circular 
case and more recently such a result is proven in [15] - for other work related 
to the linear statistics of the C/3E, see e.g. [23, 10]), what our approach offers 
is a rate of convergence (which is likely to be extremely far from the true 
one - in the case of CUE the rate is known to be superexponential, see [17] 
- which is much faster than the one our approach suggests) as well as a 
possibility to study the joint convergence of linear statistics whose number 
grows with n. Another benefit of this approach is that one only needs to 
estimate only rather few moments. To the author’s knowledge, such results 
aren’t known for C/3E. Moreover, this approach through Stein’s method 
coupled with Dyson Brownian motion has potential to be applied to other 
/3-ensembles. 

The outline of this note is the following: we begin by recalling the defi¬ 
nition of the C/3E and the relevant Wasserstein distance as well as stating 
our main result. Next we shall recall the approach in [9] for multivariate 
complex normal approximation, the definition of circular Dyson Brownian 
motion, and point out what the relevant estimates we shall need for applying 
Stein’s method to our case. These estimates involve the generator of circu¬ 
lar Dyson Brownian motion acting on certain power sums, which are simple 
to calculate exactly, along with moment bounds of power sums which can 
be estimated with results from [15]. Einally we point out as an application 
of the results of [15] a limit theorem for the logarithm of the characteristic 
polynomial of the C/3E. This is very similar to a result of [14] for the CUE. 

Acknowledgements: The author wants to thank two anonymous refer¬ 
ees for helpful comments about the article, as well as K. Kytola for useful 
discussions. This work was supported by the Academy of Einland. 

2. The circular j3 ensemble, The Wasserstein distance, and the 

MAIN result 

The purpose of this section is to state our main result and to do this, we 
recall the definition of the C;3E and the Wasserstein-1 distance. 

Definition 1. Let 

(3) A„ = {(xi, ..., Xn ) G [0, 271]” ■. Xi < X 2 < ... < Xn } 

and /3 > 0. The n-dimensional CjdE is the following probability measure on 


n! 


n 



3=1 


dxj 


( 4 ) 
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where the normalization constant is a Selberg integral and can he evaluated 
exactly: 


( 5 ) 



r(i + nf) 


Remark 2. We will often identify [0, 27r) with the unit circle and with 
a subset of the n-fold product of the unit circle with itself. 


The Wasserstein-1 distance is a metric on the space of random variables 
taking values in a fixed underlying space (which we’ll take to be Euclidean, 
but more general cases are possible) with finite first absolute moment. Con¬ 
vergence with respect to it is equivalent to convergence in law along with 
convergence of the first absolute moment. Let us recall its two equivalent 
definitions (see e.g. Chapter 6 in [24] for more information on Wasserstein 
distances): 

Definition 3. The Wasserstein-1 distance between the laws of two (or 
as we’ll actually be interested in) valued random variables - X and Y - 
is 


(6) >v(‘^^(A,y) =infE(|X-y|), 

where the infimum is over all couplings of X and Y. 

An equivalent definition for the metric (a result due to Kantorovich and 
Rubinstein) is given by 


Wi‘'\x, Y) = sup{ E(/(X)) - E(/(y))| f:R^^R, 

(7) \f{x) - f{y)\ < |x - y| for all x,y e M'’*}. 

We can now state our main result. 

Theorem 4. Let be drawn from the n-dimensional C/3E with /3 > 

2 

0, d= o{n7), 


( 8 ) 

and 


Td 



d 


k=l 



where Zj are i.i.d. standard complex Gaussians. Then 


( 10 ) 


as n ^ CO. 


w‘f\Td,Gd) = 0{d^/^/n) 
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Remark 5. As in [9], we could consider instead ofT^ a vector of the form 

( 11 ) ■ 

/ k=r 

where also r grows with n and one will get constraints on r and d for the 
vanishing of the Wasserstein distance with similar methods as those we use. 
For simplicity, we only consider the case ofTd- 

Remark 6. One can use this result to study linear statistics of functions on 
the unit circle with nice enough regularity by Fourier expanding them and 
applying our result. 

3. Stein’s method and circular Dyson Brownian motion 

We’ll give a short informal sketch of the Stein’s method argument for mul¬ 
tivariate normal approximation that will be relevant for us. For a detailed 
treatment, see e.g. [20]. After this, we shall state the precise theorem (that 
appears in [9]) that we shall make use of. Next we shall review the definition 
and some basic properties of circular Dyson Brownian motion and how it 
ties into our Stein’s method argument. 

3.1. Stein’s method. For simplicity we’ll consider the case of real normal 
variables (the complex one follows from this). Let us assume that S is a 
symmetric positive definite dx d matrix. We’ll denote by D a d-dimensional 
vector of i.i.d. standard Gaussians and by Ys we denote \/sy. 

We’ll also use the following notation: by (•,-)hs we denote the Hilbert- 
Schmidt inner product of two matrices 

(12) {A,B)ns = TT{AB*), 

where B* denotes the Hermitian conjugate of B. We’ll denote by || • ||hs the 
corresponding norm. 

We will then make use of the following facts (see [20]) 

Fact (Fact 1). A random d-dimensional vector X agrees in law with Is if 

(13) E ((Hess/(X), S)hs - {X, V/(X))) = 0 

for each f £ C'^(M'^) for which the above integrand is in (with respect to 
the randomness). Here Hess/ is the Hessian matrix of f, and the second 
inner product is the Euclidean inner product ofW^. 

Fact (Fact 2). If g & C°°(M'’*), then 

O 1 

(14) h{x) = Uog{x):= —{Eg{Vix-I VlXtYs)-Eg{Ys))dt 

Jo 

is a solution to the differential equation 
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(15) {x,Vh{x)) - (Hess h{x), S)hs = 9 {x) - EgiYs). 

Let us now assume that we have a random vector X for which we wish to 
show that the law of X is close to that of Is hr the sense of the Wasserstein 
distance, and let us further assume that we have another random vector X' 
on the same probability space as X and X' = X. Moreover, let us assume 
that 

(16) E{X' - X\X) = -AX + V, 

for some invertible deterministic matrix A and some random vector V. We’ll 
want to think of X' being close to X so that when for example Taylor 
expanding f{X') around X for some function /, we can ignore high enough 
order terms. Also we assume that 

(17) E((X'-X)(X'-X)^|X) = 2AS + M, 

where S is again our deterministic symmetric positive definite matrix and 
A'l is a random d x d-dimensional matrix. 

Let us fix some g G C'°°(M'^) and let / = Uog- Then as X = X' 

0 = ((A-i(X' - X),Xf{X') + V/(X))) 

(18) 

= ^E {{A-\X' - X),Xf{X') - V/(X))) + E ((A-HX' - X),Xf{X))) 

= Ie {{A-\X' - X),Ressf{X){X' - X))) + E {{A-\X' - X), V/(X))) 

+ ..., 

where we Taylor expanded Xf{X')—Xf{X) around X and ... denotes higher 
order terms in the expansion. Noting that 

(19) 

(A-i(X'-X),Hess/(X)(X'-X)) = {A-\X'- X)iX'- Xf ,Ressf{X))ns 
and conditioning on X, we find that 


0 = E(Hess/(X), E)hs - ]E((^, V/(X))) 

(20) + ^E{{A-^M, Hess/(X))Hs) + E((A-1 h, V/(X))) + ... 

From Fact 2, we then find 

( 21 ) 

E( 7 (X) - E 5 (yE) = ^1E((A-1 m, Hess/(X))Hs) + E((A-iy, V/(X))) + ... 
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As Hess/ and V/ are bounded, if we can control A~^M and A~^V (and 
the higher order terms), this suggests that we can control the Wasserstein 
distance. This is indeed the case. We’ll actually construct a one parameter 
family of the vectors X' through Dyson Brownian motion started from an 
independent C/IE realization and the closeness of X and X' will come from 
the t —)• 0 limit. Let us state the actual theorem for the Stein’s method 
argument in the following form (see Theorem 1.3 in [9] and Theorem 4 in 
[20] for proofs) 

Theorem 7 (Dobler and Stoltz, Meckes). Let W, Wt (fort > 0) beC’^ valued 
L^(P) random vectors on the same probability space (D,A., P) such that for 

each t > 0, (W,H/) = (H/,1T). Let Z £ be a d-dimensional random 
vector whose entries are i.i.d. standard complex Gaussians. Suppose that 
there exist non-random matrices A, S £ such that A is invertible and S 
is positive definite. Assume further that there exists a random vector R £ 
random matrices S,T £ and a deterministic function s : (0, oo) —)• M 

with the following properties 

(i) ^E (Wt - W\W) ‘4 °-AW + Rm L\¥) 
s{t) 

(ii) ^E {{Wt - W){Wt - iy)*|W) *4° 2AS + 5 in 4(11 • ||hs,P) 

s{t) 

(iii) ^E {{Wt - W){Wt - Wf\W) ‘4° T in 4(|| • ||hs,P) 

" ^W\Wt-W\^>e}) = 0 , 

for each e > 0. 

Then 


(22) W^^\W, 4SZ) < ||A-i||op (e\R\ + ^||S-^||opE(||5||HS + ||r||Hs)) , 
where || • jjop denotes the operator norm: for A £ 


(23) l4llop = sup \Ax\. 

x^‘C^:\x\ = l 

Remark 8. As noted in [9], we can replace the estimate for E(|1W — 
VTpl|Vi/j_w|>e) weaker one 


(24) 

since 


hm4-E|H/-WP = 0 

t -,0 s{t) ' ' 


n\wt - w|4|h.,-wi>.) < ln\wt - iHi'). 


( 25 ) 
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3.2. Circular Dyson Brownian motion. In this section we define cir¬ 
cular Dyson Brownian motion and point out how it ties into our Stein’s 
method argument. 

Circular Dyson Brownian motion was introduced by Dyson [8] and Dis¬ 
cussed for example in [23]. Its existence is proven in [4]. It is a model for 
diffusing particles confined to the unit circle and interacting with each other 
through a logarithmic repulsion. The main result of [4] is that one can make 
the following definition: 

Definition 9 (Circular Dyson Brownian motion). Let /3 > 0. n-dimensional 
Circular fd-Dyson Brownian motion is a (^([O, oo), A„) valued semimartin¬ 
gale (which we denote by {x{t))t>o = (xi(t), Xnit))t>o) which is the unique 
strong solution to the system of stochastic differential equations 

(26) dxj{t) = ^ cot r) — '^-^dt-\-V^dbjif), 

for j = l,...,n. Here bj are i.i.d. standard Brownian motions. 

Remark 10. It is proven in [4j, that for /3 > 1 the particles almost surely 
do not collide (so Xi { f ) / Xj { f ) for i j for all t), but for j 3 G (0, 1) they 
almost surely do. 

In the following remark we’ll informally recall some basic facts from dif¬ 
fusion theory applied to our setting. 

Remark 11. As we are dealing with continuous semimartingales, we can 
make use of Ito’s lemma, and general facts from diffusion theory hold. In 
particular, a simple application of ltd’s lemma implies that we have for some 
fixed x(0) G An and function f 


(27) 


[/(x(t))]=/(x(0))+E"(o) 


[Lfjf]{x{s))ds 


where denotes expectation with respect to the law of the process started 
from x(0), and can be viewed as the infinitesimal generator of the process: 


(28) 


Ls = 


k=l l^k 


Xk- Xl d 

2 dxk 


+ 


2 


2'EE 

k=ll^k 


^ixk _j_ ^ixi Q 

giXk _ Qixi 


n 


+E 


k=l 


94' 


As for fi <1 there can be collisions, there is some care to be taken about 
what the precise domain of the infinitesimal generator is (for example, if 
f e CffAn) is a function in the domain of the generator, then one must 
have that lima;^.^^-).^;^ cot[(xj+i — Xj)/2]{dj+i — dj)f is finite, or in other 
words, dj+if{x)\,,.^^=xj = djf{x)\xj+,^=a:^). 

From (27) we see that if pt{x; x{0)) is the density of the law of x{t) started 
at x(0), then it satisfies the equation 
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(29) dtpt{x,x{Q)) = L*ppt{x,x{{))), 

where L*^ is the adjoint of Lj^: 



This implies that the Cj5E is a stationary distribution for circular Dyson 
Brownian motion. To see this, note that for 

( 31 ) p{x) = Cn,pll\e^^^ 

j<k 

where is a normalization constant, and V{x) = — log |2sin f |, a simple 
calculation making use of the fact that lcot(x/2) = —V'[x) shows that 

(32) L}p = 0. 

Thus the unique solution to dtpt{x, x{0)) = L*^pt{x,x{0)) with initial data 
given by the CfdE: pq{x,x{0)) = p{x), is pt{x,x{0)) = p{x) - or the C(dE is 
a stationary distribution for circular Dyson Brownian motion. 

A similar argument shows that for g in the domain ofLp, L*p[gp] = [Lpg\p, 
i.e. that Lp is in fact self-adjoint on L‘^{p). Thus the Cj3E is a reversible 
measure for Dyson Brownian motion which implies that the pair {x{0),x{t)) 

is exchangeable (i.e. {x{0),x{t)) = {x{t),x{0))) for each t > 0. 

Let us now prove our main estimates required for applying Theorem 7. 
This entails estimating E(/(x(t))|x(0)), when / is a function relevant to 
Theorem 7. This will be done through estimates on Lpf for relevant /. For 
(3 = 2 [11, 9] make use of similar results for the heat kernel of the unitary 
group found in [21, 18] with a different kind of approach. 

Lemma 12. Let x(0) be distributed according to the C(3E and independent 
of {bj{t)). Also let k £ Z and write for x £ [0, 27r]”, Pk{x) = ■ 

Then 


(33) 

and 


1 r 


- E"^®[pfc(x(t))]-pfc(x(0)) 4 [Lppk\{x{t))) 


t - 


(34) LpPk = -n^\k\pk “ ( 1 f - ^\k\ ^ Psgn{k)lPsgn{k)(\k\-l), 

^ ^ 1=1 

where sgn{k) = k/\k\ for k and Lp is the operator from (28). 
Moreover, for k,l £ Z 
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(35) ^ E^^^'>[pk{x{t))pi{x{t))] - Pk{x{0))pi{x{0)) *4° [L^(pfcp/)](x(0)), 


and 


(36) 


LpiPkPi) = PkLpPi+piLppk - 2klpk+i. 


In both cases, the convergence is in with respect to the law of the C/3E. 

Proof. Let us first establish the claims about the action of L/j on pk and 
PkPi- We have from (28) 


, P ^ AUjkXr , 


(37) Lppk{x)= k^pk{x) + 


Then note that 


(38) 

n 

rn=l l^m 


^IXm gixi 

^iXm _ 


EEC'"+<=“')- 

m=l l^m 


^ikxm _ ^ikxi 


^IXm _ 


+ + 


^IXl', 


,,ikxi 


^ikxm _ gikxi 


y ^(e^- +e“0^ 

/ ^ ^ ^ ^ ^IXm — 

m=l l^m 


m=l //m 

- Sk- 


^iXm _ 


We thus conclude that 


(39) Lgpkix) = k^Pk{x) k EE(^“"+^“')^ 


^ikxm _ ^ikxi 


m=l Ij^m 


^IXm _ 


For k G Z+, we expand the difference quotient and hnd (using for example 
Pq{x) = n) 
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(40) 

LpPkix) 


khk{x) -jkY^ + e“0 E 


fc-i 


x,n„i{k-l-j)xi 


-k'^Pkix) - E + e*^*) E 


m=l l^m 
n n 


j =0 

k-1 


ijXmJ{k-l-j)xi 


+ -A: E 2e“"‘A:e* 


m=l i=l 

(fe-l) 3 ;„ 


i=o 


m=l 


1-4 

2 


1-4 

2 




k-1 


j =0 

k-l 


k^Pk{x) - ^k'^pj{x)pk-j{x) - ^knpkix), 


4=1 


which was the claim for k > 0. For A: = 0 the claim is clear, and for A: < 0 it 
follows by complex conjugating the k > 0 case. For calculating L/^[pkPi], we 
note that if we write A = X]J=i then in general for twice differentiable 
functions / and g one has 


(41) 


Mfg] = f^g + g^f + 2 E 

4=1 


—/ 

dxi 


E 

dx 


3 J 


The first order part of L_a satishes a normal product rule so we find 


(42) 


n 

L!j\pkPi\ = PkLpPi + PiLpPk + 2 E 

j = l 

r d 1 

7i~Pk 

[dxj J 

r d 1 

[dxj J 

n 

= pkLppi + piLppk + 2 E 

4=1 

r d 1 

[dxj J 

r d 1 

[dxj J 


= pkLgpi + piLppk - 2kl E 

4=1 

= PkLpPi + PiLppk - 2klpk+i 

which was the claim concerning the action of on PkPi- Let us now turn 
to the convergence part. From (27) we hnd that for any fixed x € 


(43) 

\pk{x{t))\ -Pkjx) 
t 


[Lgpklix) 


^ ^x Ioli^mK^js)) - [Lf3Pk]{x)\ ds 
~ t 


As we’ve seen that is a polynomial in the variables we see 

that sup^, |[Lppfc](x)| is a finite number depending on n and k. Thus the 
random variable we’re taking an expectation of on the right side of the 
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equation is uniformly bounded in t, and by the continuity of t eA x{t) at 
t = 0, it converges to zero almost surely as t —)• 0. Thus by the dominated 
convergence theorem (applied to the integral) we conclude that the left 
side of the equation tends to zero. The same argument implies that the left 
side of the equation is bounded in x by a constant depending only on n and 
k, if we integrate over x with respect to the law of the C/3E, we can apply 
the dominated convergence theorem again to achieve convergence with 
respect to the law of the C/3E. The argument for the convergence of the 
PfeP/-term is similar. 

□ 


4. Moment estimates of power sums for the C/3E 

Before checking the conditions for Theorem 7, we need some moment 
estimates on power sums. We need a simplified version of the main result in 
[15] (their results are analogous to those of [6] though extended to general 
(3 from the unitary case through Jack polynomial theory): 

Theorem 13 (Jiang and Matsumoto). Let 0 < m < n, 

(44) 


Then 

(45) E(|pm(a;)P) < B^m 

and for 0 < m < n with 0 < j,k < m, 


——1 

I p 

n— 771 + 


^l(/3 < 2) 


and B = \ 1 


— —1 

1/3 -^1 

77 — 777+ 


t1(/3>2) 


(46) 


\E{pj {x)pm-j {x)p-k {x)pk — 777 (^))l 


max{|A - 1|, |B - 1|} (^1^ 2^j{m - j)k{m 
b(|) 2j{m- j),k = j. 


k) 


k^j 


In most of our applications, we will have m = o{n) and this becomes 
Corollary 14. For 0 < m = o(n) and n large enough 


(47) E{\pm{x)\^) < 2|m 

and for 0 < j,k < m, 


( 48 ) 

\K{pj {x)pm-j {x)p-k ix)pk-m (ic)) I < 


yjj{m - j)k{m - k)0 (^) 

4 ( 1 ) Jim- j),k = j. 


k^j 
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Proof. This follows directly from the definition of A and B noting that for 
m = o(n) 


(49) 


A,B = 1 + 0 



□ 


5. Proof of Theorem 4 

We can now check the conditions required for Theorem 7 and make the 
estimates needed for the proof of Theorem 4. Let us begin by checking the 
conditions for Theorem 7. 

Let us write W = T^, x{t) = (xi(t), ...,Xn(t)) for the n-dimensional circu¬ 
lar /3-Dyson Brownian motion started from an independent C/3E(n) vector 
X = {xi, ...,Xn), and Wt = {pi{x{t)), ...,pd{x{t))). 

The first condition for Theorem 7 involved the conditional expectation of 
Wt given W: 

5.1. E(Wt — 1T|1T) as t 0. By Lemma 12, we have as t —)■ 0 

(50) hm h{Wt - 1P|1P) = {Lf,pi{x),...,L^pd{x)) 
t^o t 

(51) =-AW + R, 
where A G with entries 


(52) 

and R £ with entries 


Afc,i = 


5k,ink 


2 ’ 


(53) Rk = —k'^ 

Next we need E((Wt 
5.2. E{{Wt-W){Wt- 


- l) Pkix) - k^'^pi{x)pk-iix). 

^ ^ ^ 1=1 

- W){Wt - W)*\W) as t ^ 0. 
W)*\W) as t —)• 0. For this, we need 


(54) E{{pj{x{t)) - pj{x)){p-k{x{t)) - p-k{x))\x) 

for j,k G Z+. To calculate this, we expand the product and consider each 
term separately: 


(55) E{pj{x(t))p-kix{t))\x) = pj{x)p-k{x) + tLp{pj{x)p-k{x)) + o(t), 

(56) E{pj{x{t))p_k{x)\x) = pj{x)p-k{x) tp-k{x)Li 3 Pj{x) o{t), 
and 
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(57) E{pj{x)p-kix{t))\x) = pj{x)p-k{x) + tpj{x)Lj 3 p_kix) + o{t). 
Thus 

E{{pj{x{t)) - Pj{x)){p-k{x{t)) -p-k{x))\x) 

(58) =t{Li3{pj{x)p_k{x)) - pj{x)Li3p_k{x) -p_k{x)Li3Pj{x)) + o{t). 
Making use of Lemma 12, we find 

|™^E((pj(a;(f)) - pj{x)){p-k{x{t)) -p-k{x))\x) = 2jkpj-k{x). 
We then write this as 

(60) lim -EiiWt - W)(Wt - W)*\W) = 2AS + S, 
t^o t 

where S £ 

(61) (AS)fc_/ = 6k,ink‘^, 

or in other words 


(62) 


Si. / — -T^Sk }k. 

’ /3 ’ 


Moreover S € with entries 


(63) 


Sk,i = (1 - Sk,i)2klpk-i{x). 


5.3. E{{Wt - W){Wt - Wf\W) as t ^ 0. Here we need 


(64) 


E((pj(x(t)) - pj{x)){pk{x{t)) - pk{x))\x) 


and a similar argument yields 


I™ ^E((pj(x(t)) - Pj{x)){pk{x{t)) -pk{x))\x) = -2jkpj+k{x) 


or (again with convergence in L^) 


( 66 ) 


lim -E((Wt - W)(Wt - Wf\W) = T, 
t^o t 


with 


(67) 


Tjk = -2jkpj+k{x)- 
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5.4. E(|Wt — W\^) as i —)• 0. Following Remark 8, it is enough for us to 
estimate W^Wt — W\^ (which for a diffusion one would expect to behave as 

but we still outline an argument for checking it directly) which in turn 
we can bound from above by . Conditioning on 

W and using (59), one finds E|M4 — W\‘^ = 0{t) as t —)• 0 and using similar 
arguments (in particular, the fact Lfs{fg) = fLpg + gLpf + 2 Y^j{djf){djg) 
repeatedly) one finds W\Wt — W\^ = o{t), and 

(68) lim^E(|kFt - kF|3) = 0. 

5.5. The Wasserstein-1 distance. Thus the conditions for Theorem 7 are 
met (A is invertible and S positive definite) and we have 

(69) 

W^f{Td,V^Z) < l|A-i||op (e\R\ + ^||S-i||opE(||5||HS + IITIIhs)) , 

where Z a d-dimensional vector of i.i.d. standard complex Gaussians, || • ||op 
denotes the operator norm (with respect to the underlying Euclidean norm), 

I • I the Euclidean norm, and || • ||hs the Hilbert-Schmidt norm. Let us check 
what these quantities are. 

Recall that 


(70) Afe,; = 6k,ink^ 
and 

(71) 'Sk,i = Sk,i^k. 
Being diagonal matrices, we note that 

(72) l|A"^||op = 

and 


(73) ||E-5 

We’ll estimate E|ii| by y/'^k and recall that Rk consisted of two 

types of terms Rk = Ak + Bk for which we write \Rk\'^ < 2{\Ak\^ + \Bk\^) and 
estimate these separately. We use a similar estimate for the Hilbert-Schmidt 
norms. More precisely, recalling the definition of R, S', and T we have 

(74) _ 

d d k—1 

^A:4E|pfc(x)|2-K^/i:2 ^ E{pi{x)pk-i{x)p-j{x)pj_k{x)), 

k=l k=l l,j=l 



E|R| < C(/3) 
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(75) E||5||hs< 




\ j,k=i 


\ j,k=l 


and 


(76) 


EIITIIhs < 


d 


'i 


^ 4j2/c2E|pj+fc(x)|2. 

j,k=l 


We then make use of Corollary 14 to get bounds on these: 


Lemma 15. For d = 0{^/n) 


(77) E|i7| = 0(^3), 

(78) E||S||HS=o(ii^), 
and 

(79) E||r||HS = o(di). 
Proof. Plugging Corollary 14 into (74), we find 


(80) 


E|i7| < C(/3) 


d d Ik—1 , 

k=^ k=^ \ 1 = 1 


The first sum is of order d®. For the second sum, we note that 


k—1 k—1 k—1 

(81) j) =k^j= 0{k^). 

j=i j=i j=i 

For the third sum, we note that 


(82) 


E Vj{k-j)Kk -1) < 


'k-1 


2 


and 
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k-l 


k-1 




i=i 


(83) 


i=i 

k-l , . 

i=i 

= 0{k^). 


Thus 


(84) 

As k = O(y^) 


E|ii| < C 




d . 

d^ + y -kA 

^ n 


k=l 


(85) E\R\=0{d^). 

For S we find (plugging Corollary 14 into (75) and using similar arguments 
as for R) 


1E||5'||hs < J y 8pk^^(.k-j) 


( 86 ) 


< C. 






= 0 d2 


and in a similar manner 


(87) 


E||r||Hs = o(d^). 


□ 


Noting that y/EZ = Gd and recalling that ||A ^\\^ = 0{n ^), plugging 
this into (69) gives for d = o(n^/'^) 


( 88 ) w\;'\Td,Gd) = 0{£l^/n). 

and Theorem 4 is proven. 

6 . The logarithm of the characteristic polynomial of the C/3E 

One of the results proven in [14] is a limit theorem where they prove, using 
mainly results of [6], that in a suitable Sobolev space of distributions, the 
real and imaginary parts of the logarithm of the characteristic polynomial 
of the CUE converge jointly in law to a pair of log-correlated Gaussian 
fields (in fact they can be understood as a restriction of the two-dimensional 
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Gaussian Free Field restricted to the unit circle with a suitable convention 
for the ’’zero mode”). 

As the results in [15] generalize those of [6] to /3 / 2, one can prove 
a similar result for the characteristic polynomial of the C/3E, though the 
estimates aren’t quite as strong for the /3 / 2 case so one does not have 
quite as good a control on the roughness of the field - one needs to consider 
slightly larger Sobolev spaces than in the f3 = 2 case. We’ll give a brief 
argument for a proof of this fact here. First we recall the definition of the 
relevant Sobolev spaces. 

Definition 16. For s & M let 


( 89 ) 'Hs — I {fk)k&z ■ ^ l/fcP(l + > 

I fcez J 

and equip it with the inner product (we write f = {fk)kGZ o,nd g = {gk)k£z) 

(90) if, g)s = J]](l + k'^Yfkgl- 

fcGZ 

With this inner product, Ids is a separable Hilbert space. We denote by 
II • ||s the corresponding norm. 

Remark 17. For s > 0, Hg can be interpreted as a subspace of the square 
integrable functions on the unit circle with s describing the degree of smooth¬ 
ness of the functions. For s < Q, Hg can be interpreted as the dual space of 
H-g so it is a space of distributions. The quantities fk are interpreted as 
the Fourier coefficients of a function (or distribution) f. 

We then define our limiting object: 

Definition 18. Let {Zj)^^ be i.i.d. standard complex Gaussians and write 
formally 

.. oo .. 

(91) X{e) = - 

Remark 19. One can check that for any e > 0, the above series converges 
almost surely in H-e so X can be understood as an element of H-g. 

We can now state the relevant limit theorem whose proof is essentially 
identical to that in [14]. A similar argument also appears in [13] so we give 
only a brief proof. 

Theorem 20. Let s > 1/2, > 0, and 

n 

(92) p„(0) = ]^(i_e*(-.-^)), 

where is distributed according to the C/3E(n). Moreover, let 
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(93) Xr,{e) = Re log Pn{e) and y„(0) = ImlogP„(0), 

where the branch of log is such that Imlog(l — € (-7r/2,7r/2] for 

all j. 

Then {Xn, Yn) converges in law in TLs xTl-s to (y^2//3X, sf^lJ^Y) where 
X is the field defined above and 

^ OO 

(94) Y{e) = -J2 

Proof. Following [14], we begin with the remark that expanding the loga¬ 
rithm gives (as an element of P-f) 

OO - / n \ OO - 

(95) logPniO) = - E - E = - E 

j=i ^ \k=i / j=i ^ 

This implies that 



and 



Thus in the Fourier basis, we have convergence in the sense of finite 
dimensional distributions (as [15] or Theorem 4 imply the convergence of 
say finite collections of the Fourier coefficients). 

By Prokohorov’s theorem, to prove convergence it is then enough to prove 
tightness. For this, one uses the fact that the unit ball in TL-s' is compact in 
TL-s for 0 < s' < s. Let us then note that by Theorem 13, if we take some 
small e G (0,1), there exists a constant C such that for 0 < j < (1 — e)n, 
E|pj(x)p < Cj while for j > {1 — e)n we trivially have K\pj(x)\‘^ < n^. 

Thus picking s' G (1/2, s) we have 

CX) - 

=Ex2(l+/)"^'lE|P,(a:)P 

i=i ^ 

(98) < C E jl+2s' E j2+2s' ■ 

This is bounded as the first sum converges as n —)■ oo and the second one is 
). A similar bound holds for Y^. Tightness then follows from the 
compactness of the unit ball mentioned above, and Markov’s inequality. □ 
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An interesting question is could one use stronger results on the linear 
statistics to give a stronger sense for this convergence. Here we essentially 
only used convergence of finite collections of the linear statistics and in no 
way made use of the fact that the number of them may grow with n. If one 
were able to extend d in Theorem 4 from to something close to re, it 

seems conceivable that one could estimate for example the distance of the 
maximum of the field A„ to the maximum of the truncation of the held X. 
Indeed, the super exponential rate of convergence for a single linear statistic 
proven in e.g. [17] suggests that our bounds are likely to be far from optimal 
so perhaps something like this could be possible. 

This could be one way to try to prove a conjecture of Fyodorov and Keat¬ 
ing in [12] (for /3 = 2), where they conjectured that the maximum of the 
logarithm of the characteristic polynomial of a CUE matrix behaves essen¬ 
tially like the maximum of a log-correlated Gaussian held (see e.g. [19, 7]). 
Another motivation for trying to improve this type of results would be to 
make better sense of the connection between random matrix theory and 
Gaussian Multiplicative Chaos (for a result in this direction, see [25] based 
on results in [5, 3], and for a review and an elementary approach to Gaussian 
Multiplicative Chaos see [22, 2]). Currently proving such results relies heav¬ 
ily on the determinantal structure and Riemann-Hilbert arguments available 
only for /3 = 2. 
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